Learning Generalized Policies in Planning Using Concept Languages
نویسندگان
چکیده
In this paper we are concerned with the problem of learning how to solve planning problems in one domain given a number of solved instances. This problem is formulated as the problem of inferring a function that operates over all instances in the domain and maps states and goals into actions. We call such functions generalized policies and the question that we address is how to learn suitable representations of generalized policies from data. This question has been addressed recently by Roni Khardon 16]. Khardon represents generalized policies using an ordered list of existentially quantiied rules that are inferred from a training set using a version of Rivest's learning algorithm 22]. Here, we follow Khardon's approach but represent generalized policies in a diierent way using a concept language. We show through a number of experiments in the blocks-world that the concept language yields a better policy using a smaller set of examples and no background knowledge. The policy representation is related to the indexical-functional representations advocated by Agre and Chap-man 1] and domain concepts such as`the-next-needed-block' andà-well-placed-block' are identiied from scratch.
منابع مشابه
Development and Validation of an Instrument to Evaluate English Language Teachers' Lesson Planning Self-concept
This study aimed to develop and validate an instrument to evaluate English language teachers’ lesson plan- ning self-concept. To this end, 30 English teachers were asked to prepare a sample lesson plan and 15 of them were invited to participate in a semi-structured interview. A tentative questionnaire including six fac- tors namely: classroom management, lesson planning conformity, planning eff...
متن کاملUtilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs
Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...
متن کاملThe Effect of Concept Mapping on Iranian EFL Learners’ Vocabulary Learning and Strategy Use
This study aimed to investigate the effects of concept mapping on the extent to which Iranian EFL learners retain new vocabularies and the degree of awareness toward vocabulary learning strategies they tended to use. To this end, a total of 40 Iranian EFL students were asked to participate in this study. They were randomly assigned to two equal groups; namely, experimental and control. The part...
متن کاملApproximately Optimal Teaching of Approximately Optimal Learners
We propose a method of generating teaching policies for use in intelligent tutoring systems (ITS) for concept learning tasks [37], e.g., teaching students the meanings of words by showing images that exemplify their meanings à la Rosetta Stone [30] and Duo Lingo [13]. The approach is grounded in control theory and capitalizes on recent work by [28], [29] that frames the “teaching” problem as th...
متن کاملMinority Language Policy and Planning in the Micro Context of the City: The Case of Manchester
This paper investigates service provisions in community languages offered by Manchester City Council and agencies working alongside to find out whether there is an explicit language policy in Manchester, how such a policy is formulated, how it functions, and how it is reflected in education. Data was collected through interviews with different personnel in MCC, focus group discussions with comm...
متن کامل